THE HOFER CONJECTURE ON EMBEDDING SYMPLECTIC 

ELLIPSOIDS 



DUSA MCDUFF 

Abstract. In this note we show that one open 4-dimensional ellipsoid embeds sym- 
plectically into another if and only the ECH capacities of the first are no larger than 
those of the second. This proves a conjecture due to Hofer. The argument uses 
the equivalence of the ellipsoidal embedding problem with a ball embedding problem 
that was recently established by McDuff. Its method is inspired by Hutchings' recent 
results on embedded contact homology (ECH) capacities but does not use them. 



1. Introduction 

Consider the ellipsoid E(a, b) := {z 6 C 2 : ^ — \- ^j- < 1} with the symplectic 
structure induced from the standard structure on Euclidean space. Define AT (a, b) to 
be the sequence of numbers formed by arranging all the positive integer combinations 
ma + nb,m,n > 0, in nondecreasing order (with repetitions). We say that M(a,b) is 
less than or equal to Af(c, d) (written Af(a, b) =^ Af(c, d)) if, for all k > 0, the /cth entry 
of M(a,b) is at most equal to the kth entry in Af(c,d). Hofer's conjecture evolved as 
earlier guesses, such as those by Cieliebak, Hofer, Latschev and Schlenk in [3], proved 
inadequate. Finally, in private conversation, he conjectured that the numbers Af(a, b) 
should detect precisely when these embeddings exist. 

We show in this note that this is indeed the case. 

Theorem 1.1. There is a symplectic embedding mtE(a,b) A- E(c,d) exactly if 

M(a,b) =4AT(c,d). 

Using embedded contact homology (ECH), Hutchings showed in [6] that the condi- 
tion Af(a, b) =4 A/"(c, d) is necessary. In fact, the main result of his paper is that there 
are quantities called ECH capacities, defined for any closed bounded subset of M 4 , 
that are monotone under symplectic embeddings. The application to embedding ellip- 
soids then follows because the ECH capacities of E(a, b) are just the sequence Af(a, b). 
Hutchings also shows that his ECH capacities give sharp obstructions to the problem 
of embedding a union of disjoint balls into a ball. 
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In the case of embedding an ellipsoid into a ball, McDuff-Schlenk [14} Thm. 1.1.3] 
calculated exactly when the embedding exists, and concluded that in this case the 
condition M{a, b) =^ A/"(c, d) is sufficient. Combining these results, we see that Hofer's 
conjecture holds when the target is a ball. Below we prove the result in general by 
a much shorter argument that uses none of the geometric results in ECH. Instead it 
uses some elementary combinatorics that develop some of Hutchings' ideas, as well as 
the result from McDuff [12] that reduces the ellipsoidal embedding problem to a ball 
embedding problem. See Hutchings [7] for a survey that gives more of the background. 

The higher dimensional analog of Theorem |1.1| is completely open; there is even no 
good guess of what the answer should be. However the analog of the Hofer conjecture 
does not hold. The first counterexamples are due to Guth |3] who showed that there 
are constants a, 6, c such that E(1,R,R) embeds symplectically in E(a,b,cR 2 ) for all 
R > 0, with similar results in higher dimensions. In [5] Hind-Kerman improved Guth's 
embedding method to show that E(l, R, . . . , R) embeds in E(a, a, R 2 , . . . , R 2 ) whenever 
a > 3, but found an obstruction when a < 3. 

Example 1.2. (i) The sequence A/*( a, a) is 

J\f(a, a) = (0, a, a, 2a, 2a, 2a, 3a, 3a, 3a, 3a, . . . ). 

In other words, for each d there are d + 1 entries of da occurring as the terms A4(a, a) 
for \{d 2 + d) < k < \{d 2 + 3d). Thus J\f(a, a) is the maximal sequence with Mh = da 
for k = \{d 2 + 2>d). 

(ii) When k = \{d 2 + 3d), the sequence 

AT(1,4) = (0,1,2,3,4,4,5,5,6,6,7,7,8,8,8,...) 

has A/fe(l, 4) = 2d = A4(2, 2). Hence the maximality of J\f(2, 2) implies that jV(l, 4) ^ 

AA(2,2). Thus int E(l, 4) A 73(2). (Here, and elsewhere, we write xAy to mean that 
X embeds symplectically in Y.) 

The first construction of an embedding of this kind is due to Opshtein [16]^] The 
paper [12] develops a general method of embedding ellipsoids, which in most cases is 
not very geometric. However, as is shown in [f2l §1], in the special case of £7(1,4) the 
argument can be made rather explicit. One still cannot see the geometry of the image 
as clearly as in Opshtein because one uses symplectic inflation to increase the size of 
the image of the ellipsoid, i.e. rather than embedding larger and larger ellipsoids into 
a fixed ball, one embeds a small ellipsoid E into the ball B and then increases the 
relative size of E by distorting the symplectic form on B\E. 

Remark 1.3. (i) We phrase all our results in terms of embedding the interior of X 
into Y (or, equivalently, embedding intX into intY), while Hutchings talks about 
embedding X into the interior of Y. But these amount to the same when X is the 



In fact he constructed an explicit embedding from inti?(l,4) into projective space by using prop- 
erties of neighborhoods of curves of degree 2, but one can easily arrange that the embedding avoids a 
line so that there is a corresponding embedding into a ball. Cf. also Theorem 4 in [17] . 
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disjoint union of ellipsoids, since in this case there is a symplectic embedding int X^->Y 
exactly if XX embeds symplectically into intY for all A < 1; cf. \12\ Cor. 1.5]. 

(ii) The current methods extend to give a simple numerical criterion for embedding 



disjoint unions of ellipsoids into an ellipsoid. See Proposition |3.5| for a precise result. 
Also, all the methods used here extend to the case when the target manifold is a 
polydisc, i.e. a product of two discs with a product form, cf. Miiller [15J, or a blowup 
of a rational or ruled surface. However, just as in [TU], one gets no information when 
the target is a closed 4-manifold with > 1 such as T 4 or a surface of general type. 

See Bauer pQ for more information on the numerical properties of the sequences 
Af(a,b). 

Acknowledgements I warmly thank Michael Hutchings for his patient explanations 
of the index calculations in ECH and for many illuminating discussions, Felix Schlenk 
for some useful comments on an earlier version of this note, and also FIM at ETH, 
Zurich, for providing a very stimulating environment at Edifest, 2010. 

2. Combinatorics 

A standard continuity argument implies that it suffices to prove Theorem |1.1| when 
the ratios b/a and d/c are rational. One of the main results of |12j states that for 
each integral ellipsoid E(a,b) there is a sequence of integers W(a, b) := (Wi, . . . , Wm) 
called the (normalized) weight sequence of a, b, such that int E(a, b) embeds in B(fi) 
exactly if the disjoint union Uint-B(W) := U int B(Wi) embeds in B(fi). This section 
shows how J\f(a, b) may be calculated in terms of W(a, b). 

We begin with some definitions. They are basically taken from [T2], but are modified 
as in [T3]. Given positive integers p, q with q < p we denote by W(p, q) = ~W(q,p) 
the normalized weight sequence of p/q. Thus W(p, q) = (W±, W2, • • • , Wm) is a finite 
sequence of positive integers defined recursively by the following rulesj^] 

• W\ = q and W n > W n +i > for all n; 

• if Wi > Wi + i = • • • = W n (where we set Wq := p), then 

W n+ i = minjWn, Wi - {n- i)W i+1 }\ 

• the sequence stops at Wm if the above formula gives Wm+i = 0. 
It is often convenient to write W(p, q) as 

(2.1) W(p,q) = {xf\Xf\. . .,Xgx), 

where Xq > X\ > • • • > X% > and Ik > 2. Thus the ii are the multiplicities of the 
entries in W(p, q) and, as is well known, give the continued fraction expansion of p/q: 

2 The proof that the weight expansion W(p, q) described above agrees with that used in [15] is given 
in the appendix to [14] . 
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Figure 2.1. One obtains the weights by cutting a rectangle into 



squares: W(5, 12) = (5, 5, 2, 2, 1, 1) =: (5 x2 , 2 x2 , l x2 ^ 



namely 
(2.2) 



4 + 



In this notation, the defining formulas for the terms in W(p, g) become: 

X_! := p, X = g, = Xi-t - iiXi, i > 0. 

In particular, because X\ = p — £$q, 



(2.3) 



W(p,q) 



,X<0 



Xi2 



{q^,W(q,X l )). 



More generally, the following holds. 



Lemma 2.1. Define p,q,Xi,£i,0 < i < K as in equations (2.1) and (2.2) and set 
X-i := p, £-\ := 1. Then for all i = 0, . . . , K — 1, we have 

WpQ-i,^) = (l^.W^Ij+i)). 

Remark 2.2. The weight sequence W(p, g) does not seem to be mentioned in el- 
ementary treatments of continued fractions. Instead, one considers the convergents 
Pk/qk '■= [^o;^i, • • • ,@k] of p/q. However, it is well known that the two mirror fractions 
p/q := [£o]£i, ■ ■ ■ ,£k] and P/Q := [£k',^k-i, ■ • • ,£o] have the same numerator p = P. 
It follows that when p, q are relatively prime the sequence 

(Xo,Xi, . . . ,X K ), 

when taken in reverse order, is just the sequence of numerators of the convergents of the 
mirror P/Q. More precisely, Xk—u = Pk-i f° r 1 < k < K + 1. Further, the rectangle 
definition of the W« (cf. Figure 2.1) easily implies that 'YlnWf = pq. Another, less 



obvious, quadratic relation for the Wj is discussed in [14, §2.2]. 

We now show that M(a, b) may be calculated from the weights W(a, b) using an 
operation # first considered by Hutchings. Given two nondecreasing sequences C = 
{C k ) k >Q,V= (V k ) k > with Cq = Vq = 0, define C#V by 

(C#V) k := max (d + V k _i). 

0<i<k 

The next result follows immediately from the definition. 

Lemma 2.3. (i) The operation C,T> i— > C#T> is associative and commutative. 
(ii) If C =4 C then C#T> =4 C'#T> for all sequences T>. 
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For any sequence a = (ai, . . . , ant) of positive integers, we define 
Af(a) := Af(ai,ai)#Af(a 2 ,a 2 )# ■ ■ . #AA(a M , a M ). 

If a := a% = ■ ■ ■ = clm we abbreviate this product as # M AA(a, a). 

To understand the effect of the operation # on sequences of the form jV(a, 6), it is 
convenient to interpret the numbers A/fc(ct, 6) in terms of lattice counting as in Hutchings 
(6J §3.3]. For A > consider the triangle 

T^ b := {{x, y) G M 2 : x, y > 0, ax + by < A}. 

Each integer point (m, n) G T^ b gives rise to an element of the sequence N(a, b) that 
is < A. If a/b is irrational there is for all A at most one integer point on the slant edge 
of T^ b . It follows that J\f k (a, b) = A, where A is such that \T^ b n Z 2 | = fc + 1. Since for 
rational a/6 there might be more than one integral point on this slant edge, the general 
definition is: 

M k (a, b) = \ni{A : \T^ b n 1?\ > k + l}. 




(ID 



Figure 2.2. Different cuts in Lemma [2~4l 



Lemma 2.4. For aZZ a, 6 > 0, we have Af(a, a)#AA(a, 6) = J\f(a, a + b). More generally, 
for all £ > I, we have (#W(a, a))#AA(a, b) = Mifl, b + ta). 

Proof. By continuity and scaling, it suffices to prove this when a, b G Z. Suppose that 
A/fc(a,a + 6) = A Then there is at le ast o ne integer point (m,n) on the slant edge 



QP of the triangle T := T^ a+fc ; see Fig 2.2 (I). Let d := |~^] be the smallest integer 
greater than ^ b , and let R be the point where the line joining Y = (0, d) to X = (d, 0) 
meets the slant edge of T. Then because the intersection of the line x + y = d — 1 with 
the first quadrant lies entirely in T all the points in the interior of triangle OXY lie in 
T. Hence S := T Pi Z 2 divides into two sets 5i and 52, where Si contains X plus all 
points (m, n) in T with m + n < d, and 52 consists of all other points in S. Let 

|5i| = ki + 1, |5 2 | = fo- 
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Then k\ + k<i = k, and our remark above about the triangle OXY implies M^ia, a) = 
da. 

Let a be the integral affine transformation that fixes Y and translates the x axis by 
— d so that X goes to the origin O. Then a takes the triangle XRP to the triangle 
T^ b , where B/a = A/a — d. The set of integral points in T^ b is a(S2 U {X}). Hence 
A/fc 2 (a, b) = B = A — da. Thus Nk x (a, a) + A4 2 (a 5 b) = A4(a, a + b). 

We claim that for all other i < k we have A/i(a, a) + Afk-i(a, b) < A4(a, a + b). To see 
this, we slightly modify the above argument as follows. If i > ki then A/i(a, a) = ad' 
for some d' > d. Decompose T by the line x + y = d' as above. Then choose S[ C 5 to 
contain X' = (fi',0) together with all points in T with x + y < d' and let 5^ = S\S[. 
Then k[ + 1 := < z + 1 (since now there may be some integer points in the interior 
of triangle QY'R') so that A/"^(a, a) < ad', while, as above, A4-fc^(o, b) = A — ad' . 

If i < k% then we choose d' as before and again slice T by the line x + y = d' . The 



corresponding triangle OX'Y' is illustrated in Figure 2.2 (II). The line X'Y' now meets 
the slant edge of T at R' lying beyond Q. Hence, if we partition the integral points in 
T as before, k' 2 + 1 := IS2I + 1 is at most the number £ + 1 of integral points in X'R'P 
(and may well be strictly smaller). Thus My (o> b) < Mg(a, b) = A — ad' . On the other 
hand, A/"^(a, a) = ad' as before. 

This completes the proof of the first statement. The second follows immediately by 
induction. □ 

Corollary 2.5. Let W(a, b) be the weight sequence for (a, b) G N 2 . Then Af(W(a,b)) 
= M(a,b). 

Proof. Without loss of generality, we may suppose that a, b are relatively prime and 
that a < b. If a = 1 then JV(1, 0) = # b N(l, 1) by the second statement in Lemma 



2.4 



For a general pair (a, 6) we argue by induction on the length K of the weight 
expansion W(o,6) = (Xq £ " , . . . ,X i £). We saw in above that X x = b - £ a 



and that W(Xi,a) = (Xj* 1 , . . . Therefore, we may assume by induction that 

Af(W(Xi,a)) =N{X 1 ,a). Thus 

Af(W(a,b)) = (#*>JV(o,o))#AT(W(Jfi,o)) 

= {# to N(a,a))#N{X u a) 

= N{a,X 1 +£ {i a)=N{a,b), 

where the first equality follows from the definition since W(o, b)) = (a x ^°, W(Xi, a)) , 
the second holds by the inductive hypothesis, and the third by Lemma |2.4| □ 



The first part of the next lemma was independently observed by David Bauer during 
Edifest 2010. 

Lemma 2.6. (i) Given integers < a < b we have Af(a, b)#N{b — a,b) < J\f(b, b) . 
(ii) For each k > 1 there is i such that Nk+i{b, b) = A4(a, b) + Me(b — a, b). 

Proof, (i) Since A/i(6, b) is constant and equal to bd on the sets of the form (d 2 + d)/2 < 
i < (cZ 2 + 3d) /2, it suffices to consider the case i = (d 2 + 2>d)/2. Then the points counted 
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(0,d)=Q 




P=(d,0) 




(ID 



Figure 2.3. Decompositions considered in Lemma 2.6 



by Afi(b,b) are those in the triangle T = OPQ in diagram (I) in Figure 2.3. Cut this 
triangle by the line QX of slope —b/a, and divide the integer points in T into two 
groups Si , 52 where 5i consists of Q plus the points to the left of QX and 52 is the 
rest. Let |5i| = k\ + 1 and |52| = &2 as before. Then, counting the points in triangle 
OQX = T^, we see that A4 1 = ad. Further, if we move the triangle PXQ first by 
the affine transformation that fixes the x-axis and takes Q horizontally to the point 
(d, d) and then by the reflection in the vertical line x = d, it is a horizontal translate of 
^bf-a Hence counting the points in the triangle PQX, we see that A4 2 = [b — o)d. 
Thus J\f kl + Nk 2 = bd. This proves (i). 

To prove (ii), suppose A4(a, b) = B and let R be an integer point on the slant edge 
of the corresponding triangle TP . Let Y be the point where the slant edge of this 
triangle meets the y-axis, so that Y = (0, B/a), and let X be where it meets the x-axis, 
so that X = (B/b,0). Next, let Q = (0,d) be the integer point with d = [B/a\, and 
put P = (d, 0) as before. Then no points in the triangle T^ a lie above the line PQ since 
\YQ\ < 1. We now divide the points in triangle OPQ into two sets as before, with 5i 
the union of R with all points to the left of XY and 52 the rest. Then |5i| = k + 1 
by construction. Further, as in (i), the points in 52 lie in a triangle that is affine 
equivalent to T^ b _ a , where A/b = \PX\ = d - B/b. Mor eover 52 contains P (which 
corresponds to the origin) but not R, which is a point on the slant edge. Thus T^ b _ a 
has |52| + 1 integer points, so that, if £ := |<S2 1 we have Me(b,b — a) = A = db — B. 
Therefore A4(a, b) +Ng(b, b — a) = db. But k + 1 = (d 2 + 3d)/ 2 by construction, so that 
Mk+t{b,b) = db. The result follows. □ 

Corollary 2.7. Let C be a nondecreasing sequence of nonnegative numbers such that 
C#JV(d- c,d) <M(d,d). ThenC <M(c,d). 
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Proof. If not, there is k such that Ck > A4(c, d). But by the previous lemma, there is 
£ such that Nk+i{d, d) = A/fc(c, d) +Nt(d — c, d). Then Ck + J\fg(d - c, d) > Mk+i{d, d) 
contradicting the hypothesis. □ 

3. Proof of Theorem 11.11 

Our argument is based on the following key results. 

Proposition 3.1. [121 Thm. 3.11] Let c,d,e,f be any positive integers and A > 0. 
Then there is a symplectic embedding 

§e ■ 'mt XE(e,f) A int E(c,d) 
if and only if there is a symplectic embedding 

<S>b- intA5(W(e,/)) U int B(W(d - c, d)) 4 B{d). 

The proof that the numerical condition is sufficient for an embedding to exist involves 
a significant use of Taubes-Seiberg-Witten theory in conjunction with the theory of 
J-holomorphic curves. However it is easy to see why it is necessary, since the ellipsoid 
E(p, q) decomposes into a union of balls whose sizes are given by the weights W(p, q); 



cf Figure 3.1 To see this, recall that the moment (or toric) image of the ball is affine 
equivalent to a standard triangle (a right-angled isosceles triangle), while that of an 
ellipsoid is an arbitrary right-angled triangle. As the diagram shows, the decomposition 



of a rectangle into squares given by the weights (as in Figure 2.1 ) yields a corresponding 
decomposition of the rectangle into (affine) standard triangles; see §2] for more 
detail. 




Figure 3.1. Cutting a triangle into standard triangles. 



The next result follows from Hutchings' observation in [6j Remark 1.10] that the 
ECH capacities N(a, b) give a sharp obstruction for ball embeddings. We give the 
proof to make it clear that it does not use any knowledge of ECH although it does use 
easier gauge theoretic results. 

Proposition 3.2. [6] Let a := (oi,...,om) be any sequence of positive numbers. 
Then there is a symplectic embedding '■ Ujint B(ai)^->B(n) exactly if A/"(a) := 
#iftf{ai,ai) 4 N{n,n). 
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Proof. Hutchings showed in [6j Proposition 1.9] that A/"(a) =^ AA(/i, /i) exactly if 
(3.1) rriiOi < /j,d, whenever m 2 + rrij < d 2 + 3d, 

i i 

where (d; mi, . . . , tom) = : (d; m) is any sequence of nonnegative integers. This holds 
because the kth entry in the sequence AA(a) = #f£ 1 AA(aj, aj) is the maximum of the 
numbers 



where ^ hi = k and ^(m? + mi) <ki< \{m 2 + 3m,). 

On the other hand it was shown by McDuff-Polterovich [13] that &b exists exactly 
if for each A G (0, 1) there is a symplectic representative of the cohomology class 
a\ := fi£ — X 2 Yji o-i&i on the M-fold blowup X M ■= CP 2 #MCP 2 that is deformation 
equivalent to the small blowup of a form on CP 2 and hence has standard first Chern 
class. (Here £ and e%, i = 1, . . . , M, denote the Poincare duals of the classes of the line 
L and the exceptional divisors E{.) Thus we need a\ to lie in the symplectic cone Ck 
of Xm given by the classes of all symplectic forms with first Chern class Poincare dual 
to -K:=3L 

After preliminary work by McDuff [10J and Biran [2 J concerning the closure of Ck, 
Li-Liu in Theorem 3] used Taubes-Seiberg-Witten theory to describe Ck in the 
following terms. Let £k C Hi{Xw,7l) be the set of classes E with —K ■ E = 1 that 
can be represented by smoothly embedded spheres of self-intersection —1. Then Li-Liu 
showed that Ck is connected and has the following description: 

C K = {a£ H 2 (X M ;^) : a 2 > 0, {a, E)>0V£e £ K }- 

Notice that if E = dL — ^rriiEi G £k then ^m 2 = d 2 + 1 while Yl m i = 3d — 1. 
Therefore each such E does give rise to an inequality of the type considered in (|3.1[ )p| 

It To »-il<rii"v r\ricnr T/~i nl~t r\r»\r rhr\r r n rv in/iniin irinn in I -i I 1 fnv W ^ /-v^-\ n vn t "\ I t r f h nr \ «r i i 



It is also easy to check that the inequalities in (3.1 ) for d — > oo imply that ^2a 2 < fi, 
which corresponds to the volume condition a 2 > 0; cf. [6, Remark 3.13]. However, 
because many tuples (d; m) with ^ m? + m; < d 2 + 3d do not correspond to elements 



in £k, it seems on the face of it that the conditions in (3.1) are more stringent than 



the geometric condition a G Ck- Lemma 3.3 below gives a purely algebraic argument 



showing that this is not the case. □ 

Let T be the set of all tuples (d; m) := (d; mi, . . . , vtlm) such that X^i( m f + m «) — 
d 2 + 3d. We denote by J- + those elements (d; m) G J- with d > and m; > for all i. 
Further we define (n; a)-(d; m) := d/x— ^ cafni =: d/i— a-m. Then, if —K = (3; 1, . . . , 1), 
we have 

(d; m) G T ^=> (d; m) • ((d; m) - K) > 0. 
^ The cone Ck is described by strict inequalities, but when we let A — » 1 these correspond to the < 



signs in (3.1 1 
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Similarly, we identify the cone Ck C H 2 (Xm',^) with the set of tuples (//; a) given by 
the coefficients of the classes a = [a£ — Yl a i e i 6 Ck- In this notation, it suffices to 
prove the following lemma. 

Lemma 3.3. Let (/x; a) := (/x; ai, . . . , a^v/) be a tuple of nonnegative numbers such that 
(i) (/x; a) • (d; m) > for all (d; m) G 

,2 



(ii) ll a ll : = yEi^ < n. 

Then (/x; a) • (d; m) > /or a// (d; m) G J r+ . 

The proof is based on the following elementary result, that is part of Li-Li (8J 
Lemma 3.4]. We say that a tuple (/x;a) is positive if /x > and > for all i; 
that it is ordered if a\ > • • • > ajvf ; and that a positive, ordered (/x; a) is reduced if 

/x > ai + a>2 + 03. 

Lemma 3.4. Let (/j; a) 5e reduced and (d; m) 6e a positive tuple such that —K-(d; m) = 
3d — ^2 mi > 0. TTjen (/x; a) • (d; m) > 0. 

Proof. Because mj < 3d, we may partition the list 

jjxrai 2 xm 2 ^j-xmu\ 

(considered with multiplicities) into sets J n , 1 < n < d, where each I n = {j n \, j n 2, jnz} 
is a set of at most three distinct numbers chosen so that each element j G {1, . . . , M} 
occurs in precisely rrij different sets I n . Then 

M d 
i=l n=l ieln 

Further J2iei n a i — I 1 f° r an In because (/x; a) is reduced. Hence 

M d 

y~] aiiui <y~]n = dp, 

i=l n=l 

as required. □ 

Proof of Lemma 3.3[ In the argument below we assume M > 3. Since we allow the 
ai to be 0, we can always reduce to this case by increasing M if necessary. Next observe 
that it suffices to prove the result for integral tuples (/x;a) and (d;m). We suppose 
throughout that (/x;a) satisfies conditions (i) and (ii). Further, if (d; m) is such that 



ll m ll := yE m i — ^ th en (d; m) • (/x;a) > d/x — ||m|| ||a|| > as required. Therefore 
we only need consider (d; m) with (d; m) • (d; m) < 0. 

Following Li-Li [8j, consider the Cremona transformation Cr that acts on tuples by 
Cr(d; m) = (d'; m'), where m'j = mj for j > 4 and 

d' = 2d — (mi + 771,2 + "13)5 m\ = d— (mj + m^) for {i,j, k} = {1, 2, 3}. 

Then Cr preserves the class K and the intersection product, and hence preserves T . 
Because Cr, when considered as acting on H2(Xm), is induced by a diffeomorphism 
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(the reflection in the sphere in class L — E\ — E2 — E3), it preserves the set of classes 
represented by embedded spheres and hence preserves Ek and Ck ■ 

Now suppose that (/i; a) 6 Ck, and denote by Orb (fj,; a) its orbit under permutations 
and Cremona transformations. Since Orb (/x; a) C Ck, all elements in Orb (fi; a) are 
positive. Moreover, if (fi; a) is ordered, a Cremona move decreases fi unless (//; a) 
is also reduced. Hence we can transform an ordered (//; a) to a reduced element 
(//; a') := C(//; a) by a sequence of k moves C%, . . . , C& each consisting of Cr followed 
by a reordering. Thus C := Cj. o ■ • ■ oC\. Take any (d; m) G with (d; m) • (d; m) < 
and denote by (d'; m') := C(d; m) its image under these moves. Then we must check 
that (d';m') • (//;a') > 0. 

There are three cases to consider. 

Case (i) (d';m') is positive. 

Since (d';m') ■ (d';m') = (d;m) ■ (d;m) < and (d';ia!) £ J 7 we must have —K 
(d'; m') > 0. Hence the result follows from Lemma 



3.4 



Case (ii) d' > but some < 0. 

In this case, let (d', m") be the positive tuple obtained from (d 1 ; m') by replacing the 
negative terms by 0. Because m 2 + m > for all m, we still have (cf ; m") £ F + . 
Further 

(<f ; m') • (p!; a') > (d'\ m") • (//; a'). 

Therefore it suffices to show that (d';m") • (//;a') > 0. If (d'jm") • (d'; m") > 0, then 
this holds by the argument in the first paragraph of this proof. Otherwise (d';m") ■ 
(d'; m") < 0, and it holds as in case (i) above. 

Case (iii) d' < 0. 

In this case, we show that (d';m') • (/x';a') > by induction on k, the length of 
the reducing sequence for (/i; a). Consider the sequence (df, m^) := Ci o ■ ■ ■ o Ci(d; m) 
of elements of T obtained by applying the moves Ci, i = 1, . . . , k, to (d; m), and let 
a^) be the corresponding elements of Ck- Consider the smallest £ for which (c^; m^) 
is negative. 

Suppose first that dg < 0. Then the entries Sj := of the previous term rri£_i 

are nonnegative, while, if t := cZ^_i we have 

3 

< 2t < si + s 2 + s 3 , ^ s, 2 + Si < t 2 + 3t. 



i=i 



Therefore, if sj = Xt 2 with A > 1, we have 



3 

2t < J^Si < 3t-(X-l)t 2 , 

8=1 



so that A < 1 + 1/t. Thus in all cases Ya=i s i < t(l + 1). But the minimum of the 
expression x 2 + y 2 + -z 2 subject to the constraints x, y, z > 0, x + y + z = 2 is assumed 
when x = y = z and is |. Therefore t(l + i) > |t 2 , which gives c?^_i = t < 3. But 
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there is no integral solution for (c^_i; ni£-i) with such a low value for di-\. Thus this 
case does not occur. 

Hence the first negative element (df, m^) must have dg > and some negative entry 
in m£. But then define m" as in Case (ii) by replacing all negative entries in by 0. 
We saw there that it suffices to show that (dp ; m") • (/^; a^) > 0. If (dg; m") • (df, m") > 
this is automatically true. Otherwise, since £ > 1, it holds by the inductive hypothesis. 

□ 



This completes the proof of Lemma 3.3 



We are now ready to prove the main result. We denote by XE(a, b) the ellipsoid 
{Ax : x S E(a, b)}. Thus XE(a, b) = E(X 2 a, \ 2 b) has corresponding sequence X 2 M(a, b). 



Proof of Theorem By standard continuity properties as explained in \12\ 

Cor. 1,5], it suffices to prove this when a,b,c,d are rational. Therefore we will sup- 
pose that c < d are mutually prime integers and that (a, b) = X 2 (e,f) where e < / 
are also mutually prime integers. We need to show that there is an embedding <3?£ : 
int XE(e, /)4#(c, d) exactly if A 2 W(e, /) M(c, d). 

By Proposition |3. 1| it suffices to consider the corresponding ball embedding an d 
by Proposition |3.2[ this exists exactly if 



J\f(\B(W(e, f)) U B (W (d, d — c))) 4 M(B(d)) = M(d,d). 

Since M(XB(W(e, /)) U B(W(d, d - c))) = A 2 W(e, f)#N{d, d - c) this condition is 
equivalent to 

(*) X 2 N(e,f)#N(d,d-c) 4 M(d,d). 
Thus the theorem will hold if we show that (*) is equivalent to the condition X 2 N(e, /) =^ 
M(c,d). 

But if A 2 W(e, /) M(c, d) then we have 

A 2 W(e, f)#M(d, d-c)^ AA(c, d)#N(d, d - c) ^ J\f(d, d) 



by Lemma 2.3 (ii) and Lemma |2.6| (i). Conversely suppose that (*) holds. Then 



X~N(e, f) =4 A' (c, d) by Corollary 2.7 Hence the two conditions are equivalent. □ 



Proposition 3.5. The disjoint union of open ellipsoids \Jf =1 E(ai,bi) embeds symplec- 
tically in E(c, d) exactly if 

AT(oi, 6i)# . . . #M(a n , b n ) 4 M(c, d). 



Proof. Though this case is not considered in |12j . the proof of Proposition 3.1 works 
just as well when the domain is a disjoint union of ellipsoids. Hence, if c < d the 
necessary and sufficient condition for this embedding of unions of ellipsoids to exist is 
that 

M(UiW(ai, 6 4 ))#AA(W(d - c, d)) 4 M(d, d). 
The proof of Corollary |2.5| adapts to show that 

^(UiWCoi, k)) = M( ai ,h)# . . . #M(a n , b n ) =: C. 
Now use Lemma |2.6| and Corollary |2.7| as before. □ 
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